SICS: Valence annotation based on seeds in word space

نویسندگان

  • Magnus Sahlgren
  • Jussi Karlgren
  • Gunnar Eriksson
چکیده

This paper reports on a experiment to identify the emotional loading (the “valence”) of news headlines. The experiment reported is based on a resource-thrifty approach for valence annotation based on a word-space model and a set of seed words. The model was trained on newsprint, and valence was computed using proximity to one of two manually defined points in a highdimensional word space — one representing positive valence, the other representing negative valence. By projecting each headline into this space, choosing as valence the similarity score to the point that was closer to the headline, the experiment provided results with high recall of negative or positive headlines. These results show that working without a high-coverage lexicon is a viable approach to content analysis of textual data.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

An annotation scheme for Persian based on Autonomous Phrases Theory and Universal Dependencies

A treebank is a corpus with linguistic annotations above the level of the parts of speech. During the first half of the present decade, three treebanks have been developed for Persian either originally or subsequently based on dependency grammar: Persian Treebank (PerTreeBank), Persian Syntactic Dependency Treebank, and Uppsala Persian Dependency Treebank (UPDT). The syntactic analysis of a sen...

متن کامل

Semantic Propagation on Contextonyms using SentiWordNet

Sentiment analysis and affect detection algorithms are generally based onto annotated data, structured into dictionaries, ontologies or word nets. The focus, so far, has been concentrated on manual annotation of the data, and then, in some situations, a semantic valence propagation is applied. The problem with this approach is that while it is able to build new affective labels through the prop...

متن کامل

UPAR7: A knowledge-based system for headline sentiment tagging

For the Affective Text task at SemEval2007, University Paris 7’s system first evaluates emotion and valence on all words of a news headline (using enriched versions of SentiWordNet and a subset of WordNetAffect). We use a parser to find the head word, considering that it has a major importance. We also detect contrasts (between positive and negative words) that shift valence. Our knowledge-base...

متن کامل

Similarity Invariants for 3 D Space Curve MatchingS

An invariant representation based on so-called similarity-invariant coordinate system (SICS) is presented for matching 3D space curves under the group of similarity transformations. In the SICS, the 3D geometry of a curve segment is unique. Thus, constraints on the curve can be fully explored for the matching. Experimental results with simulated data are presented.

متن کامل

Tags Re-ranking Using Multi-level Features in Automatic Image Annotation

Automatic image annotation is a process in which computer systems automatically assign the textual tags related with visual content to a query image. In most cases, inappropriate tags generated by the users as well as the images without any tags among the challenges available in this field have a negative effect on the query's result. In this paper, a new method is presented for automatic image...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007